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We present a study of the capability of a 500 GeV e"'"e~ collider based on CLIC tech- 
nology for precision measurements of top quark properties. The analysis is based on 
full detector simulations of the CLICJLD detector concept using Geant4, including 
realistic background contributions from two photon processes. Event reconstruction is 
performed using a particle flow algorithm with stringent cuts to control the influence 
of background. The mass and width of the top quark are studied in fuUy-hadronic 
and semi-leptonic decays of tt pairs using event samples of signal and standard model 
background processes corresponding to an integrated luminosity of 100 fb~^. Statis- 
tical uncertainties of the top mass given by the invariant mass of its decay products 
of 0.08 GeV and 0.09 GeV are obtained for the fuUy-hadronic and the semi-leptonic 
decay channel, respectively, demonstrating that similar precision to that at ILC can be 
achieved at CLIC despite less favorable experimental conditions. 



1 Introduction 

The top quark plays a unique role in particle physics. Due to its high mass, it is particu- 
larly sensitive to new physics and is intimately connected to the mechanism of electroweak 
symmetry breaking. It has sizable impact on the Higgs boson mass through radiative cor- 
rections, and, together with the W boson mass, drives electroweak predictions for the Higgs 
mass. Due to its short lifetime, the top quark decays before hadronizing, offering the unique 
opportunity to study a bare quark by accessing its properties directly through its decay 
products. Top quarks decay electroweakly, into a real W boson and a down-type quark. 
Due to the large bt CKM matrix element, the decay is almost exclusively into a W boson 
and a b quark. 

To date, top quarks have been observed at the Tevatron and at the LHC. At present, 
the best measurement of the mass is provided by the Tevatron, with a statistical error of 
0.6 GeV m. The measurement is already systematically limited, with a total systematic error 
of 0.75 GeV. Early LHC analyses obtained statistical errors on the order of 1 GeV to 2 GeV, 



with systematic errors close to 3 GeV 2|3 . With increasing integrated luminosity, significant 
improvement is anticipated, but the systematics are expected to remain substantial due to 
the challenging environment of hadron colliders and due to theoretical uncertainties ^ . 

Significant improvements are expected in e'^e~ collisions, which provide a cleaner exper- 
imental environment. The theoretically cleanest way of measuring the top quark mass is by 
means of a threshold scan, since such measurements can be linked directly to theoretically 
meaningful mass definitions. Studies suggest that combined theoretical and experimental 
errors of significantly below 100 MeV can be achieved with this technique |5 . A drawback of 
this technique is that a threshold scan requires the collider to be operated for a single mea- 
surement over an extended period, conflicting with other studies to be performed at such a 
machine. It is thus attractive to explore the possibilities for top property measurements in tt 
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production well above threshold, by performing the top reconstruction from its decay prod- 
ucts, the same technique as used in hadron colliders. Here, the theoretical interpretation of 
the observations is more challenging, but progress has been made recently in establishing 
connections between the top mass parameter used in theory and the experimentally observ- 
able invariant mass of the decay products [6||7]. In the following, the invariant mass of the 
top decay products will be referred to as the top mass. 

The relatively clean environment of e+e~ collisions, combined with the expected jet en- 
ergy and track momentum resolution of linear collider detectors, makes precision measure- 
ments in fuUy-hadronic (e^e~ — > — qqbqqb) and semi-leptonic (e"'"e~ ti ^ qqblvh) 
decay channels, the channels with the highest branching fraction, possible. For the ILC, stud- 
ies with full detector simulations have shown that statistical errors on the level of 100 MeV 
can be achieved for integrated luminosities of 100 fb~^ at yfs = 500 GeV Is)^. 



In the framework of the CLIC Conceptual Design Report 10 , top quark pair produc- 
tion was studied as a benchmark to evaluate the physics performance for processes with 
multi-fermion final states at 500 GeV, including the performance of flavor tagging in the 
CLIC environment. This process also provides the possibility for a direct comparison to the 
detector performance studies performed for the ILC detector letter of intents [slIO] . 



2 Experimental Conditions at a 500 GeV CLIC Collider 

The Compact Linear Collider CLIC [TT] is a collider concept based on normal conducting 
accelerating cavities and two-beam acceleration, which is designed to provide up to 3 TeV 
collision energy. In a staged approach, a shorter, lower energy version would be operated 
initially, while construction is under way for the full energy phase. 

Here, we study the case of a 500 GeV CLIC machine, which is directly comparable to 
the baseline design of the International Linear Collider. The use of a different acceleration 
technology leads to differences in the experimental environment which could potentially 
have a negative impact on the physics performance. The most important difference between 
ILC and CLIC in that respect is the time between bunch crossings within a bunch train, 
which is 0.5 ns in the case of CLIC, while it is 356 ns or 670 ns in the case of the ILC, 
depending on the adopted design. For typical detector integration times of the order of a 
few to 100 ns, the short bunch crossing time leads to the pile-up of background from many 
bunch crossings over the 177 ns long bunch trains, which is not present at ILC. In addition, 
the tighter focusing at CLIC results in increased beamstrahlung and correspondingly larger 
energy spread, with ^61% of the total luminosity within 1% around the peak, compared to 
~72% at the ILC. This translates into larger uncertainties when using energy or momentum 
constraints along the beam axis. 

The radiated photons lead to background through the creation of coherent and incoherent 
e^e~ pairs as well as incoherent quark pair production, which results in hadronic events. 
While the coherent pairs are emitted at very small angles, defining the crossing angle of the 
machine, the incoherent pairs have higher transverse momenta and constrain the dimension 
of the beam pipe in the experiment as well as the radius of the vertex detector. The 
hadron background affects all aspects of the event reconstruction, in particular jet energy 
measurements. At a 500 GeV CLIC machine, 0.3 77 — >■ hadrons events per bunch crossing 
are expected, with an energy of 13.3 GeV. 3.4 GeV of energy are deposited in the calorimeter 
system, 0.2 GeV out of this in the barrel detectors. 
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The detector model used in the present study is a variant of CLICJLD 12 , a detector 
concept based on Particle Flow event reconstruction. It consists of a low mass, high precision 
vertex detector and an inner silicon tracker, surrounded by a large-volume time projection 
chamber, followed by highly granular electromagnetic and hadronic calorimeters contained 
inside a 4 T solenoidal magnet with instrumented flux return for muon identification. The 
detector design is based on the ILD detector concept 8 for the ILC, adapted to account for 



the higher energy and more severe background conditions at CLIC 10 . This leads to an 
increased radius of the innermost layer of the vertex detector, which sits at 31 mm compared 
to 16 mm in ILD at the ILC. At 500 GeV, the background is significantly reduced compared 
to the 3 TeV case, permitting modifications of the detector to optimize its performance for 
the lower collision energy. In particular the innermost vertex detector layer for CLICJLD 
can move in by 6 mm to a radius of 25 mm, improving flavor tagging at low momentum. 

Minimization of the impact of the hadronic background requires strict timing cuts on 
the reconstructed particles in the particle flow event reconstruction to limit the influence of 
out-of-time contributions. Here, timing in the calorimeters is of particular importance. 

3 Event generation, simulation and reconstruction 

The signal process e+e^ — > tt, has a cross section of approximately 530 fb at a 500 GeV 
CLIC collider. The top quark decays almost exclusively into a W boson and a b quark. The 
signal events can thus be grouped into different classes, according to the decay of the W 
bosons. These are the fully-hadronic channel, with both W^s decaying into quark pairs, the 
semi-leptonic channel, with one W decaying into quarks, the other into a lepton and the 
corresponding neutrino, and the fully-leptonic channel, with both Ws decaying into lepton 
and neutrino. In the leptonic channels, the decay into a r and a neutrino is a special case, 
since the r decays almost instantly into either a lepton and a neutrino or into one or more 
hadrons and a neutrino, giving rise to additional missing energy in the final state, and 
potential confusion with hadronic decay modes. 

In the present analysis, only fully-hadronic and semi-leptonic events, excluding t final 
states, were selected, since those provide the best possible mass measurement. However, 
to account for imperfect event classification, all possible decay modes of the ti pair were 
generated according to their respective branching fractions. The top mass and width, as 
defined in the event generator, were fixed for the signal event sample to mj = 174.0 GeV 
and at = 1.37 GeV. 

In addition to the signal, background processes with similar event topologies have to be 
considered. These are mostly four and six fermion final states, with the high cross-section 
quark pair production in addition. Beyond this, the processes qqe~^e~ and qqev^ which 
are dominated by t-channel single boson production, were investigated using samples with 
reduced statistics. It was shown that the non-di-boson contributions are rejected completely 
in the analysis. Since the di-boson contributions are accounted for in the e+e^ — > WW 
and e+e^ — > ZZ modes, the processes qqe^e^ and qqev were not considered in the final 
production. 



Since WHIZARD 1.95 13 , which was used as the default event generator for the CLIC 
CDR benchmark studies flO', is not accurately reproducing final-states and explicitly defined 
intermediate states with particles with non-zero width, PYTHIA MJ was used to generate 
the signal process e+e" — tt as well as the two background processes e+e" — )• WW and 
e~'"e" — > ZZ. The processes with explicitly given final states, without specifying intermediate 
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y/s = 500 GcV, CLIC beam energy spectrum 



process type 


e+e~ -?■ 


cross section a 


event generator 


Signal (TOt = 174 GeV) 


tt 


528 fb 


PYTHIA 


Background 


WW 


7.1pb 


PYTHIA 


Background 


ZZ 


410 fb 


PYTHIA 


Background 




2.6 pb 


WHIZARD 


Background 


wwz 


40 fb 


WHIZARD 



Table 1: Signal and considered background processes, with cross sections calculated for 
CLIC500. 



particles, e+e~ — >■ qq, e+e" — )■ qqe~^e~ and e+e" — >■ qqeu were generated with WHIZARD 
1.95. Since the process e+e~ -J> WWZ is not implemented in PYTHIA, WHIZARD was 
used for its generation. For simplicity, these events were generated with zero width for the 
intermediate bosons, allowing to specify defined intermediate states in WHIZARD, keeping 
integration times short. Table [T] summarizes the studied processes and the corresponding 
event generators, with approximate cross sections at a 500 GeV CLIC machine. 

All generated events arc fully simulated in a detailed detailed GEANT4 [15 model of the 
CLICTLD detector introduced above. Before reconstruction, all events are overlayed with 
77 -> hadrons background events corresponding to 300 bunch crossings, the equivalent of a 
full bunch train. In the particle flow event reconstruction with the PandoraPFA algorithm 



16 , pt dependent timing cuts are applied to reduce the impact of this background on the 
signal. To keep the loss of signal particles in the 500 GeV events, which are characterized 
be relatively low energy particles, at a minimum, the timing cuts are considerably relaxed 
compared to the standard CLIC 3 TeV event reconstruction. Due to the significantly lower 
background multiplicity, a clean reconstruction of the signal processes is still possible. 



4 Analysis and Results 

The goal of the analysis is to provide a high-purity tt event sample with well reconstructed 
events to achieve the best possible measurement of the invariant mass of the top quark decay 
products. It favors strict rejection of imperfectly reconstructed events over the maximization 
of reconstructed top quark candidates. In general, the present analysis scheme is similar to 
the tt analysis performed for the ILD Letter of Intent. Due to a more general input sample 
in the present study, which includes semi-leptonic r events as well as fuUy-leptonic events, 
and due to the different bunch and beam structure of the CLIC machine, some additional 
analysis steps had to be introduced, while major strategy changes had to be adopted for 
other parts of the analysis. 

The first step of the analysis is the classification of the events into three branches ac- 
cording to the number of isolated leptons (electron or muon) identified in the event. Events 
without isolated leptons are classified as fuUy-hadronic, events with one or more leptons 
are classified as semi-leptonic and fuUy-leptonic, respectively. The fuUy-leptonic events are 
rejected, while events in the other two classes are clustered into jets for the subsequent 
analysis, using an exclusive kt algorithm with A77, A0 metric as implemented in the FastJet 



package 17 and a jet size parameter of i? = 1.3, selected to account for the rather large jets 



at relatively low energy. Fully-hadronic event candidates are clustered into six jets, while 
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semi-leptonic events are clustered into four jets, with the identified isolated lepton excluded 
from jet-finding. 

Following this, flavor tagging of the found jets is performed using the LCFI Flavour 
Tagging package. It is based on a neural network, which provides b and c jet probabilities 
("b-tag") for each jet in the event. The tagging is crucial for the correct assignment of jets to 
top candidates in tt — >■ (hqq)(hqq) and ti — > {bqq){blvi) events and provides discrimination of 
signal events and multi-fermion background. To construct the final state, the two jets with 
the highest b-tag are identified as the two b-quarks from the decay of the two top quarks. 
In the semi-leptonic case, one of the W bosons is constructed from the two remaining jets, 
the other from the isolated lepton and the neutrino, measured via missing momentum. In 
the all-hadronic case, a three-fold ambiguity exists in the assignment of light-flavor jets to 
W bosons. Here, the combination with the smallest overall difference of the invariant mass 
of the two W candidates from the true W mass is selected. 

After the identification of 6 jets and the pairing of light jets and leptons into W bosons, 
the next step is the grouping of W candidates and 6 jets into top quarks. This assignment 
is performed with a kinematic fit. Out of the two possible combinations, the one with the 
higher probability of the kinematic fit result is chosen as the correct combination of W 
bosons and 6 jets into top candidates. The kinematic fit, here taken from the MarlinKinFit 



package 18 , uses kinematic constraints assuming a tt event to improve the precision of 
the event parameters of interest, in the present case the invariant mass of the top quark 
candidate. 

In this analysis, the input parameters to the kinematic fit are the four-momenta of the 
light jets, already paired into W bosons in the case of the 6-jet sample, the momenta of 
the two h jets, and that of the isolated lepton for semi-leptonic events. In the latter case, 
the unmeasured neutrino represented an invisible particle in the fit, with starting values 
set to the measured missing energy and momentum in the event. In the fit, the energy 
of the system is constrained to the nominal center of mass energy, not accounting for the 
beam energy spectrum. This leads to the rejection of events with particularly low center 
of mass energy, but overall significantly improves the reconstruction of the invariant mass. 
Additional constraints are momentum conservation, accounting for the beam crossing angle, 
the mass of the intermediate W bosons, and the requirement for an equal mass of the t and 
the t candidate. 

During the fit procedure, the fitter varies the particle momenta and energies to fulfill 
the constraints. This is done within the specified detector resolution for the various input 
particles, both in energy and azimuthal and polar angle. The angular resolutions for jets and 
angular and energy resolutions for leptons is derived from Monte Carlo studies of a tt sample, 
while the energy resolution for jets is taken from the single jet performance of PandoraPFA, 
in order to avoid an overestimation of the error due to correlated effects between several jets 
originating from the same object. 

The fit fails if it is unable to satisfy the constraints outlined above within the allowed 
modifications of the input parameters. It is observed that some of the fit failures are due 
to the wrong identification of one of the b jets. This is particularly likely in the case of a 
W decaying into a charm quark and another light quark. Thus, to improve the number of 
successful fits and to account for possible wrong flavor tagging, the kinematic fit is repeated 
for unsuccessful kinematic fits after exchanging the b-jet with the lower b-tag value with 
the light-jet with the highest b-tag value. This procedure increases the number of successful 
kinematic fits by ^ 20%. 
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Figure 1: Reconstructed invariant mass distribution for the 6-jet channel without (right) 
and with (left) kinematic fit for signal-only events. The black lines shown the top mass dis- 
tribution for events with overlayed 77 — hadrons events, the red line shows events without. 
The high-mass tail in the mass distribution obtained with kinematic fit is due to kinematic 
reflections in events with incorrect assignment of jets to top candidates. 



The result of the kinematic fit, compared to the top mass reconstruction without kine- 
matic fit, is shown in Figure [l] for the fully- hadronic signal-only event sample. Also shown 
is the comparison of the performance with and without the inclusion of 77 — > hadrons 
background, ft is apparent that the kinematic fit significantly improves the reconstructed 
invariant mass. In addition, it also reduces the impact of the of 77 — hadrons background. 

Beyond the improvement of the mass measurement, the kinematic fit also serves as 
an excellent tool for the rejection of non-tt background, as well as for the suppression of 
remaining contributions from wrongly classified all-leptonic decays and semi-leptonic events 
with r final states. Additional background rejection is provided by a selection algorithm 
based on a binned likelihood technique, using event shape, flavor tagging and reconstructed 
mass parameters. Together with the kinematic fit, the background contaminations are 
reduced to a level of 11% for the fuUy-hadronic and 5% for the semi-leptonic branch. The 
overall signal selection efficiencies for the complete analysis are 35% for fuUy-hadronic and 
56% for semi-leptonic events. The higher selection efficiency for semi-leptonic top pair decays 
is mainly due to the weaker constraints in the kinematic fit due to the missing energy and 
momentum of the neutrino. This results in a slightly broader invariant mass distribution. 

Figure [2] shows the final reconstructed distribution of the invariant mass of the top decay 
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Figure 2: Final invariant top mass distribution for 6-jet events (left) and 4-jet events (right). 
Black points with error bars indicated simulated data classified at signal events. The green 
hatched histogram shows the contribution of non tt background. The blue line indicates the 
fit of the top mass. 

products, together with contributions from non background, for the fuUy-hadronic and the 
semi-leptonic event sample. From these distributions, the top mass and width is determined 
with an un-binned maximum likelihood fit, which includes a Breit-Wigner distribution for 
the mass and width, convoluted with a detector resolution function, as well as a background 
contribution and a component accounting for incorrectly reconstructed events ending up 
in the high-mass region of the distribution. The detector resolution function is determined 
from a higher statistics fully simulated signal-only sample, and was cross-checked for different 
values of the top mass and width. 
The resulting top mass is 

mtop = 174.07 GeV ± 0.08 GeV(stat) 

for the all-hadronic sample, and 

mtop = 174.28 GeV ± 0.09 GeV(stat) 

for the semi-leptonic sample. The generated top mass is 174, GeV, thus the all-hadronic mass 
is in excellent agreement with the input value, while the semi-leptonic measurement differs 
by three standard deviations. This deviation is likely mostly due to uncertainties of the 
detector resolution function, which was determined from a statistically independent sample 
of approximately 2.5 times the integrated luminosity of the signal sample. It is expected 
that this can be improved with a more thorough study. 

Also the width of the top quark was extracted from the signal fit. For the all-hadronic 
sample, a width of 

atop = 1-33 GeV ± 0.21 GeV(stat) 
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was obtained, to be compared with a generator level value of 1.37,GeV. For the semi-leptonic 
sample, a width of 

atop = 1-55 GeV ± 0.26 GeV(stat) 
is found, also in good agreement with the generator value. 

5 Conclusions 

The mass of the top quark is one of the key parameters of the standard model, and provides 
sensitivity to new physics. The present study, using full simulations including machine and 
physics backgrounds, carried out in the framework of the CLIC CDR, shows that a 500 
GeV linear e+e^ collider based on CLIC technology is an excellent tool for precision top 
measurements. The 77 hadrons background can be controlled by timing cuts and by a 
suitable choice of the jet finder, and does not significantly affect the flavor tagging for top 
events. Precise reconstruction of the event kinematics is achieved by means of a kinematic 
fit, which also serves to control the energy uncertainty due to the beam energy spectrum and 
contributed to the rejection of non-tt background. With an integrated luminosity of 100 fb"""^, 
a statistical of precision of 80 MeV is achieved in the fuUy-hadronic decay channel, and a 
precision of 90 MeV was achieved in the semi-leptonic channel. This precision is comparable 
to that expected for the ILC, despite the more challenging experimental environment at 
CLIC, demonstrating the capabilities for precision measurements at a sub-TeV e+e^ collider 
based on CLIC technology. 
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